LSD Docs

home *** CD-ROM | disk | FTP | other *** search

/ LSD Docs / LSD Docs.iso / FILEZ / lsd37.dms / lsd37.adf / Migraph.doc.pp / Migraph.doc

Wrap

Text File | 1978-01-03 | 74KB | 1,966 lines

MIGRAPH OCR OPTICAL CHARACTER RECOGNITION SOFTWARE Regards to the best - Rygar, Scooter, Munchie, Flex, 2-Tuff, H.I.M, Skid Row, Razor Blade (these docs are dedicated to you eheh) Check out these cool Alliance Boards ------------------------------------------------------------ Unknown Pleasures +44-0-204-299-17 Razor Blade The Courts of Chaos 501-470-3751 Baser Evil ------------------------------------------------------------ CONTENTS Welcome.................................................1 How to use this manual.............................2 What is OCR?.......................................2 The Main Steps in OCR Processing...........3 What Migraph OCR Can Do....................4 Pretrained Typefaces...............................5 System Requirements................................5 Installing Migraph OCR.............................5 Amiga Installation.........................6 Atari Installation.........................8 A Look Around the Screen................................9 Quick Start............................................11 Scanning...............................................18 Before Scanning...................................17 Document Quality..........................17 Contrast Adjustment.......................17 Alignment.................................18 Which DPI?................................19 Scanning Documents with a Hand Scanner............19 Text layout...............................20 Scanning from within Mighraph OCR.........22 Preview................................................27 Viewing an Image..................................27 - Page i - Defining Regions..................................28 Creating a Rectangular Clip Box...........28 Creating a Polyline Clip Box..............29 Defining Regions on a Full Page...........30 Altering a clip Box...............................31 Sizing the Clip Box.......................31 Combining Clip Boxes......................32 Moving a Clip Box.........................33 Deleting a Clip Box.......................33 Sorting Text Regions......................33 OCR Processing.........................................35 Automatic OCR.....................................35 Interactive Learning..............................36 Processing a File.........................36 Control Panel..........................................39 Input.............................................40 Using an Existing Image File..............40 Direct Scanning...........................40 Document..........................................41 Language..................................42 Dictionaries..............................42 Text......................................43 Interactive Learning......................43 Output............................................44 Text......................................44 Cache.............................................45 Get Image, OK and Cancel..........................46 Menu Commands..........................................47 Desk/MI_OCR.......................................47 Migraph OCR Info..........................47 Palette...................................47 File..............................................47 Load File.................................48 Save Region...............................48 - Page ii - Default Settings..........................48 Load......................................48 Save......................................48 Save As...................................49 Current...................................49 Quit......................................49 Scan..............................................50 Start Scan................................50 Settings..................................50 Clip..............................................51 Undo......................................51 Delete....................................51 Select All................................51 Sort Order................................51 Erase Graphic.............................52 # Text Regions............................52 Process...........................................52 Control Panel.............................52 OCR.......................................52 Rotate....................................53 Invert....................................53 Flip......................................53 View..............................................53 Zoom Full.................................53 100%......................................54 200%......................................54 Show Order................................54 Locator...........................................54 Index..................................................55 - Page iii - WELCOME Migraph, Inc. is proud to present you with Migraph OCR (Optical Character Recognition), a powerful, intelligent, full featured text reading application. Migraph OCR offers one of the fastest and least expensive means of converting printed text into an Ascii text file which can be loaded into word processing and desktop publishing programs. While it is convenient to scan directly from within Migraph OCR using one of the compatible hand scanners, you do not have to own a scanner to use Migraph OCR, you need only have access to one. Since Migraph OCR will load scanned TIFF, IMG (Atari), and IFF (Amiga) files, you may use any scanner that saves these file formats. You may even use a scanner on another computer (a PC for instance), as long as it saves one of the compatible monochrome formats. You will find Migraph OCR easy to use. Optical Character Recognition is one of those applications where all the hard work is done behind the scenes. User interaction is normally minimal, depending on the quality of the scanned text being processed. Like other Migraph programs, Migraph OCR is user friendly, employing icons, dropdown menus, and dialog boxes. Thank you for purchasing Migraph OCR. We know you'll find it to be one of your most useful applications. Before installing this program, please take the time to fill out the registration card and mail it in. Being a registered owner entitles you to receive free technical support and the Migraph newsletter "Headlines", which provides news on updates, upgrades, new products, etc. - Page 1 - HOW TO USE THIS MANUAL This manual is for use on both Amiga and Atari systems. Except where noted, all information applies to both systems. The manual is divided into several chapters. After reading this chapter and installing Migraph OCR, got to the Quick Start chapter which provides a good introduction to the OCR process and lets you see how it works. The remaining chapters have detailed information on scanning, preview, and OCR processing. Hints are found throughout the manual in the appropriate sections. Note: If you own one of the compatible hand scanners, we recommend that you read the section on direct scanning before starting. WHAT IS OCR? OCR stands for Optical Character Recognition, a process whereby printed text is electronically converted into computer readable files i.e. an Ascii file. Rather than typing text into the computer by hand, an optical scanner is used to scan the text. The scanner's job is to convert the image into a map of black and white dots (pixels) that represent the original image. When scanned text appears on your monitor screen, an "A" looks like an "A", but to the computer it is just a series of black pixels grouped together. OCR software is the vital link that identifies the group of pixels as a particular character. In a simplistic view, the program basically says, this is an "a", this is a "b" and so on. Previously, OCR programs were raster based. This meant that the program relied upon a database of stored pixel images to identify the scanned text. Unfortunately, if the scanned character did not exactly match the size and shape of a character in the database, it was not recognized. The next step in OCR was a trainable OCR. With a trainable program, when an unrecognized character appeared, you could correctly identify the character for the program and thereby "train" it to recognize the character - Page 2 - in the future. Depending on the quality of the OCR program though, you might have to train a program on a character 5, 10, 15 times or more before it would recognize the character automatically. Then came Omnifont technology. Omnifont technology uses a mathematical definition to identify the character instead of a raster image. Not only is it quicker and with a higher accuracy rate, but the program itself can be considerably smaller in size. Using mathematical definitions also means that the program has a greater chance of recognizing similar typefaces, even though it may not have been trained on them. THE MAIN STEPS IN OCR PROCESSING There are four main steps in OCR processing: 1. Line segmentation 2. Word and character segmentation 3. Character recognition. 4. Text output Line Segmentation consists of slicing a page into its different lines. Line skew and interspacing are also analyzed in this step. The most difficult documents are those with a small interline spacing and a significant skew. Word and Character Segmentation isolates one word from another and then seperates each letter from another. This step is easier when the text is monspaced, meaning all the letters are the same width. With proportional text where each letter's width is based on its shape, the process takes a little longer. The most difficult case is where characters are of a proportional font and are randomly touching. Character Recognition involves extracting the characteristics of each isolated shape and giving it a name. Text Output is the conversion of the recognized text into a file which can be imported into your word processor or desktop publishing application. - Page 3 - WHAT MIGRAPH OCR CAN DO Migraph OCR is pretrained on twenty typefaces and can be trained to recognize many more. It recognizes both American and European character sets as well as proprtional and monspaced fonts, typeset, typewritten, laser printed, and NLQ dot matrix quality text. Migraph OCR can read type-written and non-kerned proportional italics fonts. Other italics may be recognized with a little slower validation process and may require more user interaction. The character size range is 10-18 points which covers most of the font sizes used for body text. However smaller sizes can be recognized when scanned at 400 dpi. Characters of mixed sizes don't pose a problem as long as there is no possibility of confusing a capital of one size with the lower case of another size. If there is the possibility of confusion, the Migraph OCR decision regarding uppercase/lowercase may be incorrect. Using Migraph OCR for titles is not recommended since they are usually in large point sizes and may be easily added in a word processor or desktop publishing program (DTP). In addition to having Omnifont technology, Migraph OCR has intelligence. Lexicons for four languages are provided: English, French, German, and Dutch. Lexicons are linguistic databases which provide additional information, like syntax, to aid in character recognition. By using the information in the lexicons, the program is actually able to make decisions regarding characters, rather than just presenting them to you for identification. While Migraph OCR is a powerful program, it cannot do everything. It will not read parts of text in dense or kerned italics typefaces and it may have trouble reading scans that are too light or too dark, or that are skewed. - Page 4 - PRETRAINED TYPEFACES Migraph OCR has been pretrained on these typefaces: Artisian, Bookman, Brougham, Caroll Pica, Courier, Courier Italic, Delegate, Elite Modern, Helvetica, Herald Elite, Letter Gothic, Lori, Lubalin, OCRB, Pica, Prestige, Elite, Prestige Italic, Prestige Pica, Times, Titan, and Title. SYSTEM REQUIRMENTS Migraph OCR requires at least 2MB RAM and a hard disk. As with any program that scans graphics, the more memory you have, the better. A full page scan at 300 dpi normally takes up about 1MB. Migraph OCR uses hard drive caching, which means that when you run out of RAM, the remaining file information will be sent to your hard disk. While this enables you to work with larger files, it is slower than using RAM memory. - Page 5 - INSTALLING MIGRAPH OCR Note: Before installing Migraph OCR, insure that you have at least 2MB available on one of your hard drive partitions. Amiga Installation To install Migraph OCR on an Amiga system, follow these directions: [1] Boot-up your computer, if it is not already on. [2] Insert the Migraph OCR Master Disk into your floppy drive. [3] Double-click on the Migraph_OCR floppy disk icon. [4] Double-click on the Install_OCR icon. Follow the prompts that appear on your screen. When done, Migraph OCR will be installed on your hard drive and ready to run. To run Migraph OCR, double-click on the Migraph_OCR icon within the Migraph_OCR drawer. Running Migraph OCR from the CLI To start Migraph OCR from the CLI, type the following line at the CLI prompt: Run Migraph_OCR Adjusting the Screen Palette Migraph OCR for the Amiga, has a palette tool which allows you to modify - Page 6 - its screen colors by altering their RGB values. This is useful for reducing screen flicker when you want to run Interlace mode. When running Migraph OCR on a custom screen, the palette alters only Migraph OCR's colors. However, when you run Migraph OCR as a Workbench application, the palette affects the colors of the whole Workbench. This is important to remember, especially should you decide to save the colors as the default. To use the palette tool, follow these directions: [1] Select Palette from the MI_OCR dropdown menu. A dialog box will appear. If the value for planes is set to two, four colors appear; if the planes value is set at one, two colors will appear. The palette will display whatever colors you have saved as the default. However, the very first time you use the palette, it will display the default colors for the workbench: black, white, blue, and grey (2.x) or blue, black, orange, and white (1.3). [2] Select the color whose RGB values you wish to alter by clicking on it. To alter the RGB values move the slider for each value. As you move the slider, the screen and window colors will change. To reduce screen flicker in Interlace mode, choose colors with little contrast between them. For instance, instead of black and white, you might choose grey and black, or red and black. It will be immediately apparent when altering the colors, whether the flicker is getting better or worse. You may need to change the RGB value of each plane color to get maximum results. If, after changing the RGB values, you decide you would prefer the original place values, click on DEFAULT. The palette will then display the original workbench colors. To reset the values back to the previously selected palette, click on Reset. [3] After the values have been changed, you may click OK to save - Page 7 - the values for the current sesion, or click Cancel to quit the palette and reset the values as they originally were before you entered the palette dialog box. [4] If you have selected a new set of values that you would like to keep as the default from the Touch-Up custom screen, select Defaults from the File menu. The Defaults dialog box will appear. Click on Save. Thereafter, these colors will be used every time you run Touch-Up. - Page 8 - A LOOK AROUND THE SCREEN Before using the tutorial in the next chapter, you may want to familiarize yourself with the elements of Migrap OCR's screen. This section provides a brief description of each of the elements found on the screen. Boot-up your computer (if you have not already done so), then start Migraph OCR by double-clicking on the Migraph OCR program icon. You will initially see a dialog box with the copyright information, and then the program screen will appear which looks like this: - Page 9 - Close Box - Use the close box to quit Migraph OCR. Menus - These menus offer access to all the commands you need to run Migraph OCR. Slider Bars - The slider bars located on the right and bottom of the screen allow you to access all parts of an image that has been scanned or imported into the program. Top Icon Pad - This pad contains five icons which are used in the preview mode before OCR processing begins. They allow you to designate text areas using boxes or polygons which can then be edited. You may also select a graphic area you wish to save. The selected icon is inverted. Bottom Icon Pad - This icon pad contains the tools and commands you'll use to import or scan images and perform OCR on them. The center icon flips between a scanner and disk depending on whether you have opted to load a file or scan directly. Zoom Levels - Migraph OCR provides three zoom levels at which to view an image, while the black box represents the total area of the scanned or imported image, while the black box represents the portion of the image displayed on the screen. To use the Locator move the mouse over the black box, and drag the box to a new location, then release the mouse button. A new part of the image will then be displayed. - Page 10 - QUICK START The easiest way to become familiar with a program is to begin using it. This chapter leads you through a step by step tutorial using a sample scanned file which can be found on your Master Disk. Even if you own a hand scanner that allows you to scan directly, we recommend that you follow this tutorial to help you understand how the OCR process works. Note: The directions in this chapter assume that you have already installed Migraph OCR on your hard drive. [1] Double-click on the Migraph OCR program icon. The program will run and the following screen will appear. - Page 11 - [2] Click on the Control Panel icon. The following Dialog box will appear: The Control Panel is the dialog box that you will use to define all the parameters for the program. You can see that it lets you specify input, output, document characteristics, dictionaries, graphic formats, and cache path settings. [3] Under Input, click on the File button (if it is not already selected). [4] Under Document, click on New under the Dictionary button. This specifies that you will be creating a new font directory. [5] Click on the Dictionary button. The File Selector will appear. Enter the path and name for the dictionary, for instance TEMP.ALD. - Page 12 - This dictionary will hold any new characters that you train Migraph OCR to recognize during this tutorial. When you process a file, the program will use any dictionary you specify, in this case TEMP.ALD, in addition to the built-in dictionary. [6] Under Output, click on Text Out. The File Selector will appear. Enter the path and file name you wish to save the sample under, for example TEST.TXT. If you do not specify a name, the Ascii file will be automatically named MIOCR.TXT. [7] Click on the Get Image button. The file slecetor will appear. Select the file DEMO.IFF and click OK. The file will load ad be displayed on your screen. [8] Click on the Zoom Full (F) icon. [9] Click on the Text Icon. This designates the area you will be defining as a text region. [10] Click on the Clip Box icon. Move the cursor to the upper left hand corner of the top paragraph and click the left mouse button once. Move the mouse until the clip box is the desired size and click the left mouse button again. This creates a clip box around the text. - Page 13 - [11] Click on the Point Mode icon. Point mode allows you to select, size and move clip boxes. To select a clip box, click inside it. To select more than one region, click inside any additional clip boxes you wish to select. To deselect any clip box, click in it again. To size a clip box, first select it, then move the mouse over one of the polymarkers, depress the left mouse button and move the mouse (also known as 'dragging' the mouse). When the clip box is the correct size, release the mouse button. To move a clip box, first select it, then place the mouse within the box, and drag the mouse. When the box is in the desired location, release the mouse button. [12] Click on the OCR icon. Several status bars will be displayed on the screen as Migraph OCR processes the image. After a short while, a dialog box will appear that looks like this: - Page 14 - This is the Interactive Learning dialog box which is the heart of Migraph OCR. This is where you will make the decision to accept or reject characters, and where you will train the program on new characters. When it processs text, Migraph OCR goes through several decision phases where it compares the characters in the file to the trained characters in the built-in dictionary and the user dictionary (if any) and tries to identify every character. When it has recognized every character on its own that it can, it will then present the unrecognized characters to you for identification. BOX #1 displays the text that has been processed (i.e. recognized). When Migraph OCR comes across a character it cannot recognize, the word that contains it will be inverted. The program will now stop and wait for action from you. BOX #2 displays the word which contains the unrecognized character. The unrecognized character will be in bold on monochrome systems, or the opposite color on color systems. BOX #3 displays the program's best guess at the correct identity of the character. If a single letter guess is incorrect, you can simply type the correct character. If theer should actually be several characters, use the backspace key to delete the current character and the necessary amount of leading spaces and then enter the correct character(s). After a correct guess or a correction, you will use one of the buttons below to specify an action regarding the character. Each of the buttons offers you a different choice as to how to handle the character in question. Here is a brief explanation of the choices. TRAIN: Use thsi option to train the program on new characters. The program will form a mathematical definition for the character and store it in the specified user dictionary. DO NOT use this option for defaced or ill formed characters that you would not want in the dictionary. Note: When the box next to the Accept ot Train button is selected (black), it is the default button. As the default button, you may use the Return key - Page 15 - instead if using the mouse. Click on your choice to select it. ACCEPT: Use Accept when you wish to accept/correct the character in box #3 and have that character placed in the file, but NOT in the dictionary. The program will not be trained on the accepted character. CONFIRM: In some fonts, the letter i and the number 1 are identical, as are the letters o and the number 0. When this is the case, you can use the confirm key to correctly specify which is which. Thereafter, the program will not present these characters to you for identification. UNDO: Undo allows you to go back and change choices you made regarding the last fifteen (15) unrecognized characters. DELETE: Use Delete to delete unwanted characters, for example dirt specks that were picked up by the scanner. AUTO: This option processes the file or remainder of the file without interactive learning. The program replaces unrecognized characters in the text file with its best guess. If it cannot make a guess (such as when characters are joined) it will replace the unrecognized character(s) with the @ sign. STOP: Use the Stop button when you wish to stop processing before the file is done. The program will save whatever text has been processed and return you to the previous screen. [13] Process the file trying different choices. After the image is processed, the text will be saved to the file you specified earlier, and you will be returned to the main screen. [14] To view/edit the text file, ext Migraph OCR by selecting Quit from the File menu; then run your word processor and load the new text file. - Page 16 - SCANNING Note: This chapter contains information on scanning that is useful to all scanner owners. Regardless of which type of scanner you own, we recommend that you read the general sections at the beggining of this chapter before scanning text for processing. BEFORE SCANNING This section contains guidelines for scanning documents. Document Quality One of the first things to look at when considering a document for OCR processing is the quality of the printed text in the document. Is it sharp, clear, dark print, easily readable? Or is it poor quality, blurry letters, hard to read? The quality of the original affects how accurate Migraph OCR will be when processing the scan of the document. Always try to have the best quality original possible. Hint: If the original is particularly light, make a photocopy and scan the photocopy. Contrast Adjustment Scanner contrast is one of the most important factors which can affect recognition accuracy. Scans that are either too light or too dark may cause - Page 17 - the OCR to recognize fewer characters, which in turn means more interaction by you to determine character identity. Scans that are too light may cause parts of characters to drop out. For instance an "o" may look like (). Scans that are too dark cause the letters to thicken, and the openings in characters like "a" and "e" tend to close up. The space between characters also decreases making it more difficult for the program to discern individual characters. Using the contrast control on your scanner allows you to increase the quality of poor originals, and to get the best quality from good originals. Since the quality of documents does vary, we recommend that your experiment with the contrast level before you process the scan. If the contrast is too light or dark, rescan the image before processing. Alignment After contrast, the next most important item is alignment of the document to the scanner. If the scanned image is skewed, it will affect the accuracy of the OCR because characters from the first line may cross over and enter the area of the second line, etc. This interfers with the OCR's processing step of line segmentation. - Page 18 - Should your scan be skewed, we recommend that you rescan the image. If you are using a hand scanner, try using a book as a straight edge, or a scanning device, like the Migraph Scanning Tray, to get a straighter scan. Which DPI? When you wish to scan text between 10-18 points, we recommend that you scan at 300 dots per inch. For text less than 10 point we recommend you scan at 400 DPI, as this will provide more dots and a better image. For text over 18pt, use 200 DPI. SCANNING DOCUMENTS WITH A HAND SCANNER You may scan text and graphics directly from within Migraph OCR using the Migraph, AlphaData, and Golden Image hand scanners. Direct scanning is preferable because it saves time since you will not have to spend time saving and loading files that will only be deleted later. If you own one of the compatible hand scanners, we recommend that you always scan directly from within Migraph OCR. If you must scan outside of Migraph OCR with a hand scanner, scan the page in several passes, saving each pass out as a seperate file. While it is helpful to use a commercial scanning tray to get a straight scan, it is not - Page 19 - necessary to merge the passes into one file. Merging the files will not necessarily speed the processing and will take up more of your time and disk space. Note: Through the remainder of the chapter, it will be assumed that you are scanning from within Migraph OCR. If you are not, the only additional steps would be to save the scan as an IFF, IMG, or TIFF file and to load it into Migraph OCR for processing. Otherwise the instructions are identical. Text Layout How text is placed on a page and how much of it you want to scan will determine whether you scan horizontally or vertically, and how many passes you use to scan a page. If you only need to scan one column of text or a few paragraphs, you can simply scan the text and process it. However, to scan a full page of text, you must do it in several passes. The basic process of scanning a full page of text with a hand scanner, after setting all the parameters in the Control Panel, is as follows: [1] Scan the first column or portion. [2] Define the text region you wish to process with a clip box, excluding any text you do not want (such as any overlap from the second column). [3] Process the text. It will be saved to the new file you specified. [4] Re-enter the Control Panel, select Append under Output, and click OK. Thereafter, any additional text you process will be appended to that file. [5] Repeat steps 1-3 for any additional text columns/pages until the document is completely processed. - Page 20 - Hint: When you know you will be appending text, just select Append to start with when you name a file. When Migraph OCR goes to save the text to the file and does not find it, it will create the file automatically. The following illustration shows scanning the image, the image itself, and the defined text region from the scan that will be processed. Migraph OCR appends text in the order you process it. For example, on a page with three columns, you would always scan and process the first column first, the second column second, and the third column last. If you do not go in this order, the final text will not be correct. Hint: If you are scanning outside of Migraph OCR, do all the scans first, then load/process the files in order. If you have scanned the image horizontally, load the image, then select Rotate Left or Right under the Process menu to rotate the file to the correct orientation. The following illustration shows the two basic page layouts. The recommended scanning area is shows with dashed lines and the scanner shows the scanning orientation (horizontal or vertical). The numbers represent the order in which the columns should be scanned/processed. - Page 21 - Hint: When scanning horizontally, use a white sheet of paper to cover the area you do not want to scan so that you have a clean edge. You could also use this tip when scanning vertically to avoid any overlap from another column. Scanning from within Migraph OCR These instructions assume that you are in Migraph OCR, have your hand scanner properly installed, and know how to scan an image. [1] Prepare the document you wish to scan by making sure it is straight. Use some type of straight edge or guide for the scanner head to insure a straight scan. [2] Select the line art setting on the scanner head. [3] Adjust the contrast wheel if necessary. [4] Click on the Control Panel Icon. - Page 22 - The control Panel will appear. It look like this: [5] Under input, click on Scanner (if not already selected). [6] Click on the Settings button. The Scan Area dialog box will appear. - Page 23 - [7] Select the DPI you will be scanning at, by clicking on it. At the same time, make sure the DPI setting on your scanner head is set at the same value. (For this example, use the default of 300 DPI.) [8] Set the scan length by either moving the sldier bar, or by clicking on the arrows or the grey slider box. The value is displayed to the left of the slider area and will change as you move the slider. (For this example, set a length of 4 inches.) [9] Vertical scanning is the default and should already be selected. If it is not, select it by clicking on the vertical scanner icon; then click OK to exit the dialog box. [10] Under Output, click on the New button. This will create a new file that the Ascii text will be saved to. Click on the Text Out button. The File Selector will appear, select a path and name for the file and click OK. Not: If you know you will be processing several scans for one text file, select Append rather than New. This eliminates having to reenter the Control Panel. When Migraph OCR does not find the file you have named after processing the image, it will automatically create the file. [11] Click on the Get Image button. An alert will appear that states the scan will destroy the current image. Click OK to begin scanning. The scanning light will come on. Scan the image as you normally would and release the start button when done scanning. If you decide not to scan or if you release the button before scanning the full length, the scanning light will automatically turn off after 10 seconds. The scanned image will then appear on your screen. Note: Scan slowly, making sure the speed indicator light does not blink. Blinking indicates a possible loss of data that could hinder the OCR process. - Page 24 - [12] Use the locator to view other parts of the image. Move the mouse to the black box in the Locator, then drag the black box to a new area and release the mouse button. Look to see that the characters have good contrast, being neither too light nor too dark. If the contrast is not correct, rescan the image. Check the image to make sure it is not skewed. If it is, rescan it. [14] Click on the Zoom Full icon Once an image has been scanned to your satisfaction, you may select the text regions you wish to process or a graphic region you wish to save. If the graphic is a halftone, you may want to rescan the graphic portion of the image using one of the dither patterns on the scanning head. Note: Should you forget to access the Scan Area dialog box through the Input button, it will automatically appear when you select the Get Image button the first time you try to scan in a session. Once you have used this dialog box, you do not need to enter it again unless you wish to change the length or direction of the scan. - Page 25 - NOTES: - Page 26 - PREVIEW Preview is the time between scanning/loading a file and OCR processing. During this stage you may view the image and select the text regions you wish to process and any graphic regions you wish to save. A clip box is used to define these regions. While you may have multiple text regions defined on a page, you may only define one graphic region at a time. Note: If you have a scan which would normally require only a single clip box around the entire page, you do not have to define text regions. For example a typewritten letter which is in paragraphs, rather than columns. When you begin the OCR process, you will be asked if you want to process the entire page. Remember, if there is any text which was clipped by the edges of the scan, you should first define the text using a clip box. VIEWING AN IMAGE Once an image has been scanned or loaded, you may wish to view it at different zoom levels to check for skew, contrast, etc. The image will automatically be displayed at Zoom Normal. To view other parts of the image, you may use the Locator or the slider bars to the right and bottom of the image. To see a more enlarged view then Zoom 1, select one of the other zoom levels. To view the whole image, click on the Zoom Full button. Zoom ull is handy because it displays the whole scan/page making it easier to define regions. - Page 27 - DEFINING REGIONS Text regions may be defined using either rectangular or polyline clip boxes. Polyline clip boxes are very useful for irregular areas, such as when text flows around a graphic. Once graphic region may be defined/saved at a time using a rectangular clip box. Creating a Rectangular Clip Box To create a rectangle clip box, follow these directions: [1] Click on the Text icon if you will be defining a text region. OR Click on the Graphic icon if you will be defining a graphic region. [2] Click on the Clip Box icon [3] Move the mouse to the upper left corner of the region you want to define and click the left mouse button once. Move the mouse until the clip box is the desired size and click the left button again. - Page 28 - Creating a Polyline Clip Box A polyline box is used when you need to define irregular shaped text areas. Often a graphic may be located within a tet column or have text wrapped partially around it. In these cases a rectangular clip box would either not capture all of the text or include part of the graphic with the text, neither of which is satisfactory. Since the polyline clip boxes with Migraph OCR must have all line segments at 90 degree angles, it's important to think about the shape you are going to use beforehand. To create a polyline clip box, follow these instructions: [1] Select the text icon. [2] Click on the Polyline icon. [3] Move the mouse to the area you wish to define. It is usually a good idea to start at the top left of the region. [4] Click the left mouse button to start the segment. As you move the mouse a line will be drawn to the left/right of the starting point depending on where you move the mouse. The first seg- - Page 29 - ment is always horizontal. When the line is the correct size and in the correct location, click the left mouse button again. After specifying the end of a segment, the next segment will always turn at a 90 degree angle. [5] Continue step 4 until the polyline is one segment short of being complete. To close the polyline double-click the left mouse button and the last line segment will appear automatically. Note: After the first line segment is drawn, you may move the mouse in any direction in order to extend the line or to turn 90 degrees. You may not go back in the same direction you just came from. Note: To abort a polyline clip box while creating it, hit the escape key. Defining Regions on a Full Page You may define multiple text regions within Migraph OCR. The number of regions you have currently defined is displayed in the Clip menu. When a full page image is loaded, you will be able to define the text regions for the whole page before processing it. Because Migraph OCR processes text regions in the order they are defined/numbered, remember to select them in the order you wish to have them appear in the final text file or to - Page 30 - resort them to prior processing. Otherwise, the text file will not be in the same order as the original document and you will need to edit the text file. ALTERING A CLIP BOX Once a clip box has been created you may resize, move, or delete it. To alter a clip box, follow these directions: [1] Click on the Point Mode icon. [2] Select a clip box by clicking on it. To deselect it, click on it again. A selected clip box will display polymarkers (small boxes) around its border. Note: A clip box must be selected before you can size or move it. Sizing the Clip Box To size the clip box, move the mouse over a polymarker, and drag the mouse. Whne the clip box is the correct size, release the left mouse button. The clip box will grow or shrink depending on the direction you moved the mouse. - Page 31 - When sizing a text clip box, if it should overlap another defined text region, the two clip boxes will combine and form one region. Depending on the page layout, this may not be desirable. An undesirable instance is when two full length vertical text columns are defined and they join to become one. Because Migraph OCR first segments the text by lines, it will read the top line of the first column and then the top line of the second column, and so on. This is how the text will be displayed in the processing dialog box and the final text file. Editing would then take place in a word processor. Combining Clip Boxes Rectangular clip boxes can be used to define irregular regions. You may use several overlapping rectangular boxes to quickly define a complex region. In fact, it is often quicker and easier than using a polyline clip box. To define an irregular area using rectangular clip boxes, follow these directions: [1] Select the clip box tool and define the first region. [2] Still using the clip box tool, define the remaining regions, making sure to overlap them where needed. The clip boxes will join automatically. - Page 32 - Moving a Clip Box To move a clipbox, place the mouse inside a selected clip box, then drag the mouse. A hand appears in the box while you are moving it. Move the box to the new location and release the left mouse button. Deleting a Clip Box To delete a clip box, first select the clip box, then either choose Delete from the Clip menu or use the Delete key. After deleting a clip box, use the Show Oredr command to make sure the text regions are in the correct order for OCR processing. If they are not, use the Sort Order command to resort them. Undo The Undo command located in the Clip menu will undo the previous clip box command. Undo can also be used to undo an undo (for those times when you reversed a command and then decided you didn't really want it reversed). SORTING TEXT REGIONS Migraph OCR processes multiple text regions in the order they were defined. Because of this, there will be times when text regions need to be sorted, otherwise, the text in the Ascii file will appear in a different order than the original document. For example, Sort Order would be used when a text region has been deleted *interferring with the order), or when the regions were defined out of order. The Sort Order command, found under the Clip menu, allows you to define (or redefine) the order of the text regions for OCR processing. The Sort Order command is generally used for full page images where more than one or two text regions can be defined. Hand scanner owners should never really have need of this feature. - Page 33 - Note: If you have defined the text regions in the correct order, there is no need to use the Sort Order command. To sort defined text regions, follow these drections: [1] Select Sort Order from the Clip menu. The view will change to Zoom full, and the existsing order will be removed. All text boxes will be displayed and any graphic region hidden. The Show Order command is automatically activated so the order is displayed on the screen. [2] Select text regions one at a time, starting with the region you wish to have processed first. From the second region n, the order will be displayed. A line will be drawn from region to region in the order in which they were selected. Each region is assigned a processing number which appears above the upper left hand corner of the region. [3] When you have selected all but the last text region, the new order will be completed and you wil be out of the Sort Order command. Should you decide to change the order, simply select the Sort Order command and start over. To remove the display after sorting, deselect the Show Order command under the View menu. Note: To abort the Sort Order command, press the Escape key. If you wish to abort, you must hit escape before selecting the next to last region. Note: Text layouts with only one column of text do not need to be sorted, whether scanned by a flatbed or hand held scanner. - Page 34 - OCR PROCESSING After loading an image file into Migraph OCR (whether by scanning or loading an existsing file), and selecting the different text regions, you are now ready to run OCR on the image. This chapter explains how the OCR process in this program works, and how to use the Interactive Learning mode. AUTOMATIC OCR Two types of OCR processing are available in Migraph OCR: Automatic and Interactive Learning. Automatic OCR processes the file without any interaction by you. It is quicker than Interactive Learning because it does not require your interaction on questionable characters, but no learning or correct to doubtful characters takes place. When in Automatic mode, the program replaces unrecognized characters with its best guess. If it has no guess, for example when two characters are attached, it replaces the unrecognized character(s) with an @ symbol. Any corrections to the file would then take place in your word processor or desktop publishing program. To use Automatic mode, enter the Control Panel and turn Interactive Learning "off" by deselecting it. Note: Automatic mode would only be used when the current user dictionary has been trained on the same font styles as the current image. - Page 35 - INTERACTIVE LEARNING Interactive Learning is most often the better choice for OCR processing. While it is the slower of the two methods, it provides you with the opportunity to identify unrecognized characters. To use Interactive Learning, enter the Control Panel and click on the Interactive Learning box. Note that it is the default, so if you haven't changed the setting to Automatic, you need not change anything. Processing a File When OCR is performed on an image, whether Automatic or Interactive, the program goes through multiple decision making phases behind the scenes before presenting its solutions to you. It will run through the file several times trying to identify the characters from those in its main dictionary as well as any dictionary you have specified. It will also use the lexicons to help idetify unrecognized characters and to formulate the best possible guess as to the identity of each character. Once this phase is over, the Interactive Learning dialog box will appear and present the unrecognized characters to you. The Interactive Learning dialog box looks like this: - Page 36 - BOX #1 displays the text that has been processed (i.e. recognized). When Migraph OCR comes across a character it cannot recognize, the word that contains the character will be inverted. The program will now stop and wait for action from you. BOX #2 displays the word which contains the unrecognized character. The unrecognized character will be in bold on monochrome systems, or the opposite color on color systems. BOX #3 displays the program's best guess at the correct identity of the character. If a single letter guess is incorrect, you can simply type the correct character. If there should actually be several characters, use the backspace key to delete the current character and the necessary amount of leading spaces and then enter the correct character(s). After a correct guess or a correction, you will use one of the buttons below to specify an action regarding the character. Each of the buttons offers you a different choice as to how to handle the character in question. Here is an explaination of the choices. TRAIN: Use this option to train the program on new characters. The program will form a mathematical definition for the character and store it in the specified user defined dictionary. DO NOT use this option for defaced or ill formed characters that you would not want in the dictionary. Note: When the box next to the Accept (or Train) button is selected (black), it is the default button. As the default button, you may use the Return key instead of clicking on the button. To change the default, click on the other button. ACCEPT: Use Accept when you wish to accept/correct the character in box #3 and have that character placed in the file, but NOT in the dictionary. The program will not be trained on the accepted character. CONFIRM: In some fonts, the letter i and the number 1 are identical, as are the letter o and the number 0. When this is the case, you can use the confirm key to correctly specify which is which. Thereafter, the program - Page 37 - will not present these characters to you for identification. UNDO: Undo allows you to go back and change choices you made regarding the last fifteen (15) unrecognized characters. DELETE: Use Delete to delete unwanted characters, for example dirt specks that were picked up by the scanner. AUTO: This option processes the file or remainder of the file without interactive learning. The program replaces unrecognized characters in the text file with its best guess. If it cannot make a guess (such as when characters are joined) it will replace the unrecognized character(s) with the @ sign. STOP: Use the Stop button when you wish to stop processing before the file is done. The program will save whatever text has been processed and return you to the previous screen. To view/edit the final text file after processing, load the text file into a text editor, word processor, or DTP program. Hint: Using a text editor or word processor which operates as a desk accessory will save time when you wish to view the file. Note: Additional information regarding training can be found in the chapter on the Control Panel under Dictionary. - Page 38 - CONTROL PANEL The Control Panel is where you will select all the settings that govern image input and output, file types, document specifications, and more. This chapter describes the features of the Control panel in depth and how to use them. The Control Panel is brought up by either clicking on the tool icon or by selecting Control Panel from the process menu. The Control Panel looks like this: The Control Panel is divided into four sections: Input, Document, Output, and Cache. - Page 39 - INPUT The Input section specifies whether the image (input) will be directly from a scanner or from a previously scanned image file. Using an Existing Image File When you wish to load an existing image file, select File by clicking on it. The File Mask button allows you to select the file type you wish to load, and displays only that file format in the File Selector. The currently selected file type is displayed to the right of the button. To change the file type, click on the File Mask button. When the pop-up menu appears, make your choice by clicking on it. Note: You may load either TIFF or IMG (Atari) or IFF (Amiga) files from the File menu, regardless of the File Mask set in the control Panel. Direct Scanning If you have one of the campatible hand scanners (and enough memory) we strongly recommend that you scan from within Migraph OCR. To do this, select Scanner. Before the first time you scan in a session, the program expects you to set certain scanner parameters. To do this, click on the Settings button. The Scan Area dialog box will appear. - Page 40 - Within this box you will set the dots per inch (dpi), the scan length, the unit of measurement, and whether you wish to scan vertically or horizontally. DPI - To select a dpi, click on your choice, making sure it corresponds to the dpi set on the scanner head. The default is 300 dpi. SCAN LENGTH - The scan length value is displayed in the upper left corner of the length section. This value is based upon the amount of memory avaialble at the time Migraph OCR starts up. This value can only be changed by using the slider bar. You cannot edit the value itself. To setthe scan length, you may move either the white slider bar, click on the arrows, or click in the grey area. If you click in the grey area, one click moves the slider by .1 cm, or .1 inch, depending on which unit of measurement you have selected. UNIT OF MEASUREMENT - To select a unit of measurement, click on your choice, either inches or centimeters. The value of the scan length will correspond to the selected unit of measurement. SCANNER ORIENTATION - Migraph OCR allows you to scan both vertically or horizontally. The default is portrait (vertical). To scan horizontally, click on the landscape scanner icon. When you scan an image horizontally, Migraph OCR automatically rotates the scan so that it appears vertically on your screen. After making your choices, click OK to return to the Control Panel. Note: When scanning horizontally, scan from left to right. DOCUMENT The Document section provides Migraph OCR with specific information regarding the document you wish to process. You will specify the language, the user dictionary you wish to use, text size, text pitch and whether you wish to use Interactive Learning or not. - Page 41 - Language Migraph OCR includes lexicons for English, French, German, and Dutch languages, with English being the default. The current language is displayed to the right of the Language button. To select a different language, click on the Language button. When the pop-up menu appears, make your choice by clicking on it. Dictionaries Migraph OCR allows you to create user defined dictionaries in addition to the built-in library. In this case, dictionary does not refer to spelling, but instead to a collection of stored character definitions. Any characters you wish to train the program on will be stored in the user defined dictionaries. Before processing a document, you may choose to create a new user dictionary, append an existing user dictionary, or use only the built-in dictionary. User dictionaries can be quite handy, especially when you have a multipage document that may have special characters like math symbols or Greek characters. Differenrt dictionaries can be created for specific types of documents. NEW - Select New when you wish to start a new user dictionary. Click on the Dictionary button. The File Selector will appear. Enter the name for the new dictionary and click OK. Dictionaries have the file extension .ALD. Shoudl you forget to add the file extension, Migraph OCR will automatically add it for you. APPEND - Select Append when you wish to use the existing characters in a user dictionary as well as add new caharcters. READ - Select Read when you wish to have the program use the existing characters in a user dictionary, but not learn any new ones. When adding new characters to a user dictionary, make sure they are char- - Page 42 - acters you will want to keep. We recommend that you do not train the program to recognize defaced or damaged characters, as they will be used infrequently and take up space in the dictionary file. The exception to this is when you have a large document or several documents that are of poor quality and will not scan well. In this case, you may want to have a special dictionary that contains defaced or irregular characters that will be used when processing these documents. WE STRONGLY RECOMMEND THAT YOU CREATE DIFFERENT USER DEFINED DICTIONARIES FOR DIFFERENT STYLES OF DOCUMENTS. For instance, you might create one for typewritten documents, one for laser printed documents, one for typeset documents, etc. We have found that the larger the dictionary is, the slower the OCR processing will be. Another reason for having separate dictionaries is that during the interactive learning process, sometims the least used characters in a dictionary will be eliminated in favor of more frequently used characters. Text Document text is classified as either small (under 10 point) or normal (10-18 pint). The default is normal. When you know you will be working with text under 10pt be sure to set the choice to small, otherwise the OCR will not work as accurately as it could. Hint: Be sure to scan small text at 400 dpi. This dpi provides more dots and the scanned characters will therefore be more accurate. Text pitch must also be set. The default is proportional text, or text whose width is determined by its character shape. Fixed text, or monospaced text as it is more commonly known, has characters whose width is identical regardless of character shape. Interactive Learning Interactive Learning is the process of training the program to recognize - Page 43 - new characters. When you wish to use Interactive Learning, select it by clicking on the button to the left. A solid box signifies the choice is active. When Interactive Learning is turned off, Migraph OCR will process images in Automatic mode. In Automatic mode, the program will identify characters as best it can. Characters it cannot recognize, it will replace with either its guess, or the @ sign, if it has no guess. OUTPUT In the Output section you will select the file name and format for saving text and graphics files. Text The Text section determines the file format and style of the text file you'll be saving. Currently the only file format available is ASCII. In the future, additional text formats for specific word processors or DTP programs may be available. NEW - Before processing a file, you need to provide a file name and location where you wish to have the text file saved to. If it is a new file, first select New to designate it as such; then click on the Text Out button. The File Selector will appear. Enter the path and file name that you wish to use. The file extension should be .TXT. Warning: If you forget, and do not specify a file name before processing, Migraph OCR will save the text file to the file named in the Control Panel, overwriting the existing information. APPEND - There will be time when you'll want to add additional text to an - Page 44 - existing file, for instance, when you process a multipage document, or when you've scanned a full page in two or three passes using a hand scanner and want both images in one file. In these cases selecting Append will add the new processed text to the end of the current file. Note: If you know that you will be appending several images, rather than selecting New for the file, select Append. When Migraph OCR goes to save the processed text to the named file and does not find it, Migraph OCR will automatically create it for you. PARAGRAPH - When the Ascii file is saved, this option will put a carriage return at the end of each paragraph. We strongly recommend that you use this option for most of your documents. LINES - When the Ascii file is saved, this option retains the formating (column width) of the scanned text by putting a carriage return after each line. Unless you specifically need this feature, we recommend the Paragraph option since the file it produces is much easier to edit. CACHE (Atari Only) - Page 45 - GET IMAGE, OK AND CANCEL At the bottom of the process panel are three buttons: Get Image, OK, and Cancel. OK confirms changes you have made to the settings. CANCEL deletes changes you have made and restores the original settings. GET IMAGE is the choice you select when you want to bring a new image into Migraph ICR, whether it is importing an existing file or scanning a new one. Depending on whether the Input button is set to Sacnner or File, the Get Image button will react differently. When the chosen Input is File and you click on Get Image, the File Selector will appear. Select the name of the file you wish to load and click OK. The file you selected will load and you will be returned to the Migraph OCR screen. You can now proceed to preview and select the text regions for OCR processing. Scanning directly is handled differently. Migraph OCR does not allocate the memory it needs for scanning until the Scan Area dialog box has been brought up. Even if you do not intend to alter the settings, it will appear the first time in a session that you try to scan. Thereafter, it will not appear unless you bring it up to change the settings. When the chosen Input is Scanner and you click on Get Image, the Scan Area dialog box will appear if you have not yet entered it (through the Settings button). Make any changes you desire and click OK. If you have already entered it, an alert box will appear. It states that the current image will be destroyed and offers you the choice to continue or abort. Select OK to scan, or Cancel to quit. When you select OK, the scanning window light will come on and you may begin to scan your image. Further details on direct scanning may be found in the chapter on Scanning. - Page 46 - MENU COMMANDS This section contains a brief description of all the commands found in the drop down menus. Any command that has an icon associated with it will have the icon pictured next to the command. Keyboard shortcuts are normally displayed in the program's dropdown menus next to the appropriate command, however, because keyboard shortcuts differ between versions, they have been eliminated from the menus shown in this chapter. If a menu entry has (..) or (>>) after it in the drop down, a pop-up menu will appear after you select the command. If the menu entry has three dots (...) after it in the menu, a dialog box will appear after you select it. DESK/MI_OCR MIGRAPH OCR INFO - This command brings up a dialog box with the current version number and copyright statements. PALETTE - (Amiga Only) This command brings up a dialog box that allows you to adjust the screen colors by altering their RGB values. This is useful for reducing screen flicker when you want to run in Interlace mode. Instructions for using the Palette tool can be found under Installation in the Welcome chapter. - Page 47 - FILE ---- LOAD FILE... Use this command when you wish to load an existing image file. The File Selector will appear. Enter the name and click OK. If you have a choic of file formats, we recommend that you load IMG (Atari) or IFF (Amiga) files, since they will load much quicker than TIFF files. SAVE REGION... Use this command to save the currently selected graphic region as an IMG (Atari), IFF (Amiga), or TIFF file. The File Selector will appear. Enter the path and name you wish to save the graphic to and click OK. DEFAULT SETTINGS The following commands all pertain to Migraph OCR defaults. LOAD... Use this command to use previously saved default files. The File Selector will appear. Select the default file you wish to use and click OK. - Page 48 - SAVE... Migraph OCR defaults are saved in the file MI_OCR.DEF that is loaded automatically everytime the program is run. Use the Save Default Settings command when you wish to save the current settings as the default for Migraph OCR startup. SAVE AS... Migraph OCR allows you to save different settings from the Control Panel as default settings. To make a new default file other than the one that will be loaded into Migraph OCR automatically, enter the Control Panel, make any desired changes you wish, and click OK. Then use the Save As command. The File Selector will appear. Enter the path and file name for the default file, using the ile extension .DEF. To retreive any previously saved default files, use the Load command. CURRENT... This command brings up a dialog box which displays all the settings you have selected in the Control Panel. It is an easy way to view your selections. Note that this dialog box is for viewing only, as no settings in it may be altered. The Current Settings dialog box looks like this: - Page 49 - QUIT Use this command when you wish to exit Migraph OCR. SCAN ---- START SCAN This command allows you to begin scanning from within Migraph OCR. If you have not already entered the Scan Area dialog box, it will appear. Make any desired changes and click OK. An alert will then appear; choose OK to turn the scanner on, or choose Cancel to stop. SETTINGS... This command brings up the Scan Area dialog box. Use this dialog box to set the scanning DPI, scan length, unit of measurement, and scan direction (vertical or horizontal). - Page 50 - CLIP ---- The commands in this menu act only on selected clip boxes. UNDO Use the Undo command to "undo" the previous clip command. Because this command acts like a toggle, it can also be used to undo an undo. DELETE This command deletes any selected clip boxes. SELECT ALL This command selects all text and graphics regions. SORT ORDER... Use this command to sort the text regions so they are in correct order before OCR processing. This command is meant for full page scans where more than one or two text regions may be defined. For more information on this dialog box, refer to the Preview chapter. - Page 51 - ERASE GRAPHIC Use this command to erase a selected graphic. # TEXT REGIONS This displays the current number of defined text regions. PROCESS ------- CONTROL PANEL Thsi command will bring up the Control Panel dialog box in which you specify the settings for input, document, output, and cache path. OCR... This command initiates the OCR recognition process on the currently selected text region(s). When iNteractive Learning is selected, the Interactive Learning dialog box will appear on the screen after processing. - Page 52 - ROTATE.. Use this command to rotate the current image 90 degrees to the left or right. Use thsi option when you have scanned an image horizontally outside of Migraph OCR and need to rotate it. INVERT Use thsi command to invert the image. This command is useful when you have scanned white text on a colord or black background. FLIP Use this command to flip the image horizontally or vertically. VIEW ---- Zoom Full This command assumes that the program window frame is set to the full screen, and displays the whole image to fit into that area. Note that if your window is set smaller, you will only see part of the image. - Page 53 - 100% Thsi command shows the image at actual size. As much of the image as possible is displayed in the window. 200% This command displays the image at twice its original size. This is helpful when the image you are working with has a lot of small type. SHOW ORDER Use this command to display the order in which the text regions will be processed. Lines will flow from region to region showing the processing path and a processing order number will appear above the upper left hand corner of the text region. LOCATOR This onscreen icon is used to display the total area of the image in relation to the area of the image displayed on the screen. The white box represents the total area of the image, while the black box represents the portion of the image displayed on the screen. You may use the Locator to view other parts of the image. Simply move the mouse to the black square, depress the left mouse button and move the mouse. The new portion of the image will then be displayed. - Page 54 - INDEX Abort 30 Sort Order 34 Append Dictionary 42 Text 24, 44 Ascii Paragraph format 45 Saving 44 Automatic 44 OCR processing 35 Using 35 Cache 45 Changing Scan Length 41 Characters Defaced 42 Unrecognized 35, 44 Clip Box Combined 32 Commands 51 Creating 28 Defining Irregular 32 Delete 33, 51 Deselecting 31 Irregular 32 Joining 32 Moving 31, 33 Polyline 29 Select all 51 Selecting 31 Undo 51 Close Box 10 Control Panel 39, 51 Defaced Characters 42 Default Cache Path 45 Language 42 Default File 49 Default Settings Get 48 Defaults 41, 48 Save 48 Save settings 49 Delete Clip box 33, 51 Dictionary Append 42 Defaced characters 42 Main 42 New 42 Reading 42 Special 42 Using 42 Document Settings 41 DPI Selecting 41 Drop Down Menus 10 Exit 49 - Page 55 - File Mask 40 File Menu 47 Files Default path 40 Loading 40 Scanning 40 Selecting 40 Full Box 10 Get Image 46 Graphic Region Creating 28 Graphics Defining regions 28 Irregular regions 29 Output 44 Icon Pad 10 Image Viewing 27 Images Defining regions 28 Full Page 30 Invert 53 Loading 48 Loading large 45 Rotate 53 Viewing 53 Viewing 100 percent 54 Input 40 Installation 6 Amiga 6 Atari 8 Interactive Learning 35, 52 Dialog box 36, 43 Using 36 Interafec 9 Invert Image 53 Language Default 42 Selecting 41, 42 Languages Supported 4 Lexicons 4, 36 Lines 45 Load Images 48 Locator 10, 54 Manual How to use 2 Memory Alloctaing 46 Menu Commands 47 Control panel 52 Current settings 49 Delete clip box 51 Get defaults 48 Invert image 53 Load file 48 Migraph OCR Info 47 Number regions 52 OCR processing 52 Rotate image 53 Save defaults 48 Save region 48 Save settings 49 Select all clip boxes 51 - Page 56 - Show Order 34, 54 Sort Order 33 Start Scan 50 Undo 51 Viewing 100 percent 54 Viewing 200 percent 54 Zoom Full 53 Monospaced Text 43 Moving Clip box 33 OCR Automatic 35 Defaults 48 Definition 2 Interactive Learning 43 Main steps 3 Omnifont 3 Preview 27 Processing 3, 35, 36, 52 Processing regions 30 Undrestanding 2 Omnifont Definition 3 Opening Screen 9 Bottom icon pad 10 Close box 10 Drop down menus 10 Full box 10 Locator 10 Slider bars 10 Top icon pad 10 Zoom levels 10 output 44 Palette Adjusting 6 Paragraph Format 45 Paragraph Format 45 Pitch 43 Polyline Clip Box Aborting 30 Creating 29 Practice Session 11 Pretrained Fonts Dictionary 42 Pretrained Typefaces 5 Preview 27 Processing 3 Automatic 44 Interactive 43 Proportional Text 43 Quit 49 Read Dictionary 42 Region Saving 48 Regions Defining 28 Defining full page 30 Defining in order 30 Number 52 Rotate Image 41, 53 Save Region 48 - Page 57 - Scan Area Dialog Box 40, 46 Scanner Contrast 27 Scanning Allocating memory 46 Defaults 41 Direct 40 Horizontally 41 Start 50 Supported Scanners 40 Screen Adjusting the palette 6 Settings 39 Control panel 52 Current 49 Document 41 DPI 41 Hard drive caching 45 Scan length 41 Vertical/horizontal 41 Show Order 34, 54 Slider Bars 10 Sort Order 33 Special Characters Dictionary 42 Starting Migraph OCR From the CLI 6 System Requirements 5 Text Appending 24, 44 Defining regions 28 File format 45 Formats 44 Irregular regions 29 Loading 46 Monospaced 43 Number of regions 52 Output 44 Paragraph format 45 Path 44 Picth 43 Preview 27 Proportional 43 Saving new file 44 Scanning 46 Show Order 34 Size 43 Sorting regions 33 Text Region Creating 28 Text Scanning 46 Training 43 Tutorial 11 Typefaces Dictionary 42 Pretrained 5 Viewing Images 27 Zoom Viewing 27 Zoom Full 53 Zoom levels 10 - Page 58 - That's it!